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Abstract — The capacity of multiuser networks has been a long- 
standing problem in information theory. Recently, Avestimehr et 
al. have proposed a deterministic network model to approximate 
multiuser wireless networks. This model, known as the ADT 
network model, takes into account the broadcast nature of 
wireless medium and interference. 

We show that the ADT network model can be described within 
the algebraic network coding framework introduced by Koetter 
and Medard. We prove that the ADT network problem can be 
captured by a single matrix, and show that the min-cut of an 
ADT network is the rank of this matrix; thus, eliminating the 
need to optimize over exponential number of cuts between two 
nodes to compute the min-cut of an ADT network. We extend the 
capacity characterization for ADT networks to a more general 
set of connections, including single unicast/multicast connection 
and non-multicast connections such as multiple multicast, disjoint 
multicast, and two-level multicast. We also provide sufficiency 
conditions for achievability in ADT networks for any general 
connection set. In addition, we show that random linear network 
coding, a randomized distributed algorithm for network code 
construction, achieves the capacity for the connections listed 
above. Furthermore, we extend the ADT networks to those with 
random erasures and cycles (thus, allowing bi-directional links). 

In addition, we propose an efficient linear code construction 
for the deterministic wireless multicast relay network model. 
Note that Avestimehr et a/.'s proposed code construction is not 
guaranteed to be efficient and may potentially involve an infinite 
block length. Unlike several previous coding schemes, we do 
not attempt to find flows in the network. Instead, for a layered 
network, we maintain an invariant where it is required that at 
each stage of the code construction, certain sets of codewords 
are linearly independent. 

Index Terms — Network Coding, Deterministic Network, Alge- 
braic Coding, Multicast, Non-multicast, Code Construction 



I. Introduction 

Finding the capacity as well as the code construction for 
the multi-user wireless networks are generally open problems. 
Even the relatively simple relay network with one source, 
one sink, and one relay, has not been fully characterized. 
There are two sources of disturbances in multi-user wireless 
networks - channel noise and interference among users in 
the network. In order to better approximate the Gaussian 
multi-user wireless networks, [fl][]2] proposed a binary linear 
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deterministic network model (known as the ADT model), 
which takes into account the multi-user interference but not the 
noise. A node within the network receives the bit if the signal 
is above the noise level; multiple bits that simultaneously 
arrive at a node are superposed. 

References IT)(|2] showed that, for a multicast connection 
where a single source wishes to transmit the same data to a 
set of destinations, the achievable rate is equal to the minimal 
cut between the source and any of the destinations. Note that 
min-cut of an ADT network may not equal to the graph 
theoretical cut value, as we shall discuss in Section [V] In 
addition, they showed that the minimal cut between the source 
and a destination is equal to the minimal rank of incidence 
matrices of all cuts between the two nodes. This can be 
viewed as the equivalent of the Min-cut Max-flow criterion 
in the network coding for wireline networks ||3]|3]. It has 
been shown that for several networks, the gap between the 
capacity of the deterministic ADT model and that of the 
corresponding Gaussian network is bounded by a constant 
number of bits, which does not depend on the specific channel 
fading parameters JT] J5] J6) . 

In this paper, we make a connection between the ADT 
network and network coding - in particular, algebraic network 
coding introduced by Koetter and Medard J4(. This paper is 
based on work from [7|[8|[9|. Other approaches to operations 
in high SNR networks have been proposed [10], however, 
we do not compare these different approaches but build upon 
the given model proposed by HI lE) . We show that the ADT 
network problems, including that of computing the min-cut 
and constructing a code, can be captured by the algebraic 
network coding framework. 

In the context of network coding, (4) showed that the 
solvability of the communication problem [3] is equivalent to 
ensuring that a certain polynomial does not evaluate to zero 
- i.e. avoid the roots of this certain polynomial. Furthermore, 
[4| showed that there are only a fixed finite number of roots of 
the polynomial; thus, with large enough field size, decodability 
can be guaranteed even under randomized coding schemes as 
shown in ifTTI . As we increase the field size F q , the space 
of feasible network codes increases exponentially; while the 
number of roots remain fixed. 

We show that the solvability of ADT network problem can 
be characterized in a similar manner. The important difference 
between the algebraic network coding in |4] and the ADT 
network is that the broadcast as well as the interference 
constraints are embedded in the ADT network. Note that the 
interference constraint, represented by the additive multiple 
access channels (MAC), can be easily incorporated into the 
algebraic framework in [4] by pre-encoding at the transmitting 
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nodes (i.e. MAC users). This is due to the fact that the MAC is 
modeled using finite field additive channel; thus, the operations 
performed by the MAC can be "canceled" by the transmitter 
appropriately pre-encoding the packets. 

On the other hand, the broadcast constraint may seem more 
difficult to incorporate, as the same code affects the outputs 
of the broadcast channel simultaneously, and the dependencies 
propagate through the network. Thus, in essence, this paper 
shows that this broadcast constraint is not problematic. 

To briefly describe the intuition, consider an ADT network 
without the broadcast constraint - i.e. the broadcast edges do 
not need to carry the same information. Using this "uncon- 
strained" version of the ADT network, the algebraic frame- 
work in 0I can be applied directly; thus, there is only a finite 
fixed number of roots that need to be avoided. Furthermore, as 
the field size increases, the probability of randomly selecting 
a root approaches zero. Now, we "re-apply" the broadcast 
constraints to this unconstrained ADT network. The broadcast 
constraint fixes the codes of the broadcast edges to be the 
same; this is equivalent to intersecting the space of network 
coding solutions with an hyperplane, which enforces the output 
ports of the broadcast to carry the same code. As shown 
in Figure Q] this operations does not change the polynomial 
whose root we have to avoid, but changes the hyperspace 
we operate in. As a result, this operation does not affect 
the roots of the polynomial; thus, there are still only a fixed 
finite number of roots that need to avoided, and with high 
enough field sizes, the probability of randomly selecting a root 
approaches zero. Note that intersecting the space of network 
coding solutions with an hyperplane may even "remove" some 
roots of the polynomial from consideration; therefore, we may 
effectively have fewer roots to avoid. By the same argument 
as ffl lfTTll . we can then show that the solvability of an ADT 
network problem is equivalent to ensuring that a certain 
polynomial does not evaluate to zero within the space defined 
by the polynomial and the broadcast constraint hyperplane. 
As a result, we can describe the ADT network within the 
algebraic network coding framework and extend the random 
linear network coding results to the ADT networks. 

Using this insight, we prove that the ADT network problem 
can be captured by a single matrix, called the system matrix. 
We show that the min-cut of an ADT network is the rank of 
the system matrix; thus, eliminating the need to optimize over 
exponential number of cuts between two nodes to compute 
the min-cut of an ADT network. We extend the capacity 
characterization for ADT networks to a more general set of 
connections, including single unicast/multicast connection and 
non-multicast connections such as multiple multicast, disjoint 
multicast, and two-level multicast. We also provide sufficiency 
conditions for achievability in ADT networks for any general 
connection set. Furthermore, we extend the results on ADT 
networks to those with random erasures and cycles (thus, 
allowing bi-directional links). 

We show that a direct consequence of this connection 
between ADT network problems and algebraic network coding 
is that random linear network coding, a randomized distributed 
algorithm for network code construction, achieves the capacity 
for the connections listed above. However, random linear 
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Fig. 1: A polynomial f(x) and a hyperplane g(x) with one 
variable x G R. The black dots represent the roots of f(x). 
When considering the space where f(x) intersects g(x), the 
hyperplane g(x) limits the space in which f(x) operates in; 
however, does not change the roots of f(x). Some of the 
roots of f(x) may no longer be "feasible" given the additional 
constraint; thus, this operation may reduce the number of roots 
that we have to consider. 



network coding does not guarantee decodability; it allows 
decodability at all destinations with high probability. 

Therefore, we propose an efficient linear code construction 
for multicasting in ADT networks that guarantees decodabil- 
ity, if such code exists. Note that Avestimehr et ai's proposed 
code construction is not guaranteed to be efficient and may 
potentially involve an infinite block length. Unlike several 
previous coding schemes lfr2llfT3llfT4l . we do not attempt to 
find flows in the network. Instead, for a layered network, we 
maintain an invariant where it is required that at each stage of 
the code construction, certain sets of codewords are linearly 
independent. We assume that any node in the network can 
potentially be a destination. We design the code such that if 
the min-cut from the source to a certain node is at least the 
required rate, then the node will be able to reconstruct the 
data of the source using matrix inversion. In addition, when 
normalized by the number of sinks, our code construction has 
a complexity which is comparable to those of previous coding 
schemes for a single sink. 

Our construction can be viewed as a non-straightforward 
generalization of the algorithm in lfl5l for the construction 
of linear codes for multicast wireline networks. Each sink 
receives on its incoming edges a linear transformation of the 
source. The generalization of the code construction to the ADT 
network model is not straightforward, due to the broadcast 
constraint and the interference constraint, which are embedded 
into the ADT network model. 

The paper is organized as follows. We present the network 
model in Section Hill and an algebraic formulation of the ADT 
network in Section |IV] Using this algebraic formulation, we 
provide a definition of the min-cut in ADT networks in Section 
IVl In Sections [VI] we restate the Min-cut Max-flow theorem 
using our algebraic formulation, and present new capacity 
characterizations for ADT networks to a more general set 
of traffic requirements in Section IVIII The results in Section 
I VIII show the optimality of linear operations for non-multicast 
connections such as disjoint multicast and two-level multicast 
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connections. In Section IVIII1 we study ADT networks with 
link failures, and characterize the set of link failures such that 
the network solution is guaranteed to remain successful. Fur- 
thermore, in Sections |IX] we extend the achievability results to 
ADT networks with delay. In Section [Xj we present our code 
construction algorithm for multicasting in ADT networks, and 
analyze its performance. Finally, we conclude in Section [Xl] 

II. Background 

Avestimeher et al. introduced the ADT network model to 
better approximate wireless networks JT]|]2]. In the same work, 
they characterized the capacity of the ADT networks, and 
generalized the Min-cut Max-flow theorem for graphs to ADT 
networks for single unicast/multicast connections. 

It has been shown that for several networks, the ADT 
network model approximates the capacity of the corresponding 
Gaussian network to within a constant number of bits. For 
instance, (H considered the single relay channel and the dia- 
mond network, and showed that the gap between the capacity 
of the ADT model and that of Gaussian network is within 1 
bit and 2 bits, respectively. Reference considered many- 
to-one and one-to-many Gaussian interference networks. The 
networks in are special cases of interference network with 
multiple users, where the interference are either experienced 
(many-to-one) or caused by (one-to-many) a single user. It was 
shown that in these cases, the gap between the capacity of the 
Gaussian interference channel and the corresponding deter- 
ministic interference channel is again bounded by a constant 
number of bits. The work in provided an alternative proof 
to lfl6l on the existence of a scheme that can achieve a constant 
gap from the capacity for all values of channel parameters. 
In |6|, the half-duplex butterfly network was considered. 
They showed that the deterministic model approximates the 
symmetric Gaussian butterfly network to within a constant. 

As a result, there has been significant interest in finding 
an efficient code construction algorithm for the ADT network 
model. In the case of unicast communication, a number of 
previous code constructions have been proposed for wireless 
relay networks. It is important to observe that in the code 
constructions for unicast communication, routing ifPJI or one- 
bit operations [12] are sufficient for achieving the capacity of 
the deterministic model. Amaudruz and Fragouli [17 1 proposed 
an algorithm which can be viewed as an application of the 
Ford and Fulkerson flow construction to the deterministic 
model. The complexity of the algorithm was shown to be 
0(\V\ \£ \R 5 ), where V is the set of nodes in the network, 
£ is the set of edges, and R is the rate of the code. In 0~3), 
another algorithm for finding the flow for unicast networks 
was developed. The algorithm is based on an extension of the 
Rado-Hall transversal theorem for matroids and on Edmonds' 
theorem. The transmission scheme in iTPJI extracts at each 
relay node a subset of the input vectors and sets the outputs 
to the same values as that subset. In fl4l . it was shown that the 
deterministic model can be viewed as a special case of a more 
abstract flow model, called linking network, which is based 
on linking systems and matroids. Using this approach, lfl4l 
achieved a code complexity 0(XNj i r log Ni ayer ), where A 




Fig. 2: Example network for a non-binary code. 



is the number of layers in the layered network, and Ni ayer 
is the maximal number of nodes in a layer. Note that linear 
network coding is known to be matroidal [18]; thus, the fact 
that ADT networks are matroidal Ifl4l is consistent with our 
result. 

In the case for multicast communication, however, routing 
or one-bit operations may not be sufficient to achieve the 
capacity in the ADT model. This can be shown by considering 
the example in Figure [2] which is given in lfl9l fl20l ETI 
for network coding. From the analysis for network coding, 
it follows that in the case of the deterministic model, the 
maximal rate 2 can be achieved simultaneously for all sinks 
only with an alphabet size which is at least 3. To see this, 
observe that to achieve rate 2 the source has to transmit at 
its outputs two statistically independent symbols x\,X2- For 
node vf, 1 < i < 4 at the second layer, the transmitted 
symbol is a certain function of the symbols Xi,X2, given 
by Hi = fi(%i,%2)- Node vf,l < i < 4 at the third 
layer transmits at its outputs two functions of j/j, given by 
fi(Vi)' fi(Ui)- Sink tj,l < i < 6 receives at its two 
inputs symbols of the form fj(yj), ff{Uj) + ftiVk) for some 
1 < J < 4, 1 < k < 4, j k. It follows that without rate loss, 
we can always assume fj (yj ) = yj for each 1 < j < 4. In that 
case, the sink i; receives yj at its upper input and can therefore 
find fj (yj ) and eliminate it from its second received symbol. 
Thus, it is equivalent to the situation in which the sink receives 
yj,yk- This in turn is exactly the situation in ll20l (Theorem 
3.1) for network coding. Since the channels are all binary in 
the deterministic model, it follows that the minimal required 
alphabet size is in fact 2 2 = 4, and therefore the minimal 
vector length is log 2 (4) = 2. Thus, for multicasting in ADT 
networks, we need to either operate in a higher field size, ¥ q , 
q > 2, or use vector coding (or both). 

References 12211231 . independently from [7|[8 |[9|, proposed 
a polynomial time algorithm for multicasting in ADT net- 
works. In particular, 11231 extended the algebraic network 
coding result [4] to vector network coding, and showed 
that constructing a valid vector code is equivalent to certain 
algebraic conditions. This result 11231 is supported by the result 
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Fig. 3: Additive MAC with two users, and the corresponding 
rate region. The triangular region is modeled as a set of finite 
field additive MACs. 



from 1241 . Reference 11241 introduced network codes, called 
permute-and-add, that only require bit-wise vector operations 
to take advantage of low-complexity operations in F2. In 



addition, [ 24] showed that codes in higher field size W q can be 
mapped to binary-vector codes without loss in performance. 
This insight, combined with that of |]4], suggests that an 
algebraic property of a scalar code may translate into another 
algebraic property of the corresponding vector code. 

III. Network Model 

As in (T]|2], we shall consider the high SNR regime, in 
which interference is the dominating factor. In high SNR, 
analog network coding, which allows/encourages strategic 
interference, is near optimal ifTO) . Analog network coding 
is a physical layer coding technique, introduced by ||25l , in 
which intermediate nodes amplify-and-forward the received 
signals without decoding. Thus, the nodes amplify not only 
the superposed signals from different transmitters but also the 
noise. Note that a network operating in high SNR regime is 
different from a network with high gain since a large gain 
amplifies the noise as well as the signal. 

In the high SNR regime, the Cover- Wyner region may be 
well approximated by the combination of two regions, one 
square and one triangular, as in Figure [3] The square (shaded) 
part can be modeled as parallel links for the users, since 
they do not interfere. The triangular (unshaded) part can be 
considered as that of a time-division multiplexing (TDM), 
which is equivalent to using noiseless finite-field additive 
MAC [26|. This result holds not only for binary field additive 
MAC, but also for higher field size additive MAC (26). 

The ADT network model uses binary channels, and thus, 
binary additive MACs are used to model interference. Prior 
to ODE), Effros et al. presented an additive MAC over a 



finite field F g |27|- The Min-cut Max-flow theorem holds for 
all of the cases above. It may seem that the ADT network 
model differs greatly from that of [27 1 owing to the difference 
in field sizes used. In general, codes in F q subsume binary 
codes, i.e. binary-vector codes in (F2)'". However, for point- 
to-point links with memory (or equivalently by allowing nodes 
to code across time), we can convert a code in (F2)" 1 to a 
code in higher field size F q and vice versa by normalizing to 
an appropriate time unit. Note that ADT network model uses 
binary additive MACs and point-to-point links. Therefore, our 
work in part shows an equivalence of higher field size codes 
and binary-vector codes in ADT networks. 
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Fig. 4: Example network. We omit I(S) and 0{T) in this 
diagram as they do not participate in the communication. 
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Fig. 5: A supernode V. 



As noted in Section H2 |24 | presented a method of convert- 
ing between binary-vector codes and higher field size codes. 
We can achieve a higher field size in ADT networks by 
combining multiple binary channels. In other words, consider 
two nodes Vi and V 2 with two binary channels connecting 
Vi to V 2 . Now, instead of considering them as two binary 
channels, we can "combine" the two channels as one with 
capacity of 2-bits. In this case, instead of using F2, we can 
use a larger field size of F4. Thus, selecting a larger field size 
F 9 , q > 2 in ADT network results in fewer but higher capacity 
parallel channels. Reference [24] also provides a conversion 
from a code in a higher field ¥ q to a binary-vector scheme 
in F 2 ™ where q < 2 m . Therefore, a solution in ¥ q may be 
converted back to a binary-vector scheme, which may be more 
appropriate for the original ADT model. Furthermore, it is 
known that to achieve capacity for multicast connections, F2 
is not sufficient [28|; thus, we need to operate in a higher field 
size. Therefore, we shall not restrict ourselves to F2. 

We now proceed to defining the network model precisely. 
A wireless network is modeled using a directed graph G = 
(V, £) with a supernode set V and an edge set £ , as shown 



Y(e 1 ) = a (hei) X(S,l) + a (2iei) X(S,2) 
Y(e 2 ) = a ihe2) X(S, 1) + a (2 , e2) X(S, 2) 
Y(e 3 ) = Y(e 6 ) = Y{e x ) 
Y(e 4 ) = Y(e 2 ) 
Y(e 5 ) = Y(e 8 ) = 

Y(e 4 ) 

Y(e 9 ) = y(eu) = P ie6 ,e 9 )Y(e 6 ) 
Y(e w ) = P(e 6 , ew )Y(e 6 ) 
Y(e 12 ) = Y(e 7 ) + Y(e 10 ) 

Z ( T , 1) = e( eil ,(T,i))^(eii) + £( ei2 ,(T,l))^(ei2) 
Z(T, 2) = e (eili ( Tj 2))y(e u ) + e {ei2 ^ 2)) Y(e 12 ) 

Fig. 6: Equations relating the various processes of Figure 2] 
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Fig. 7: A new interpretation of the example network from Figure [4] The broadcast channel is modeled using an hyperedge. As 
a result, an output port's decision to transmit or not naturally affects all the input ports adjacent to it. Furthermore, interference 
is modeled using a finite field additive MAC, which provides a set of possible binary codes at the input ports. 



in Figure |4] A supernode V G V is a node of the original 
network. We use the term supernode to emphasize the fact 
that supernode V consists of input ports I{V) and output ports 
0(V), as shown in Figure [5] Let S,T C V be the set of source 
and destination supernodes. An edge (ei,e 2 ) may exist from 
an output port e\ G 0(Vi) to an input port e 2 € I{V 2 ), for 
any V\ , V 2 G V. Let £(Vi , V 2 ) be the set of edges from 0(Vi) 
to /(V2). All edges are of unit capacity, where capacity is 
normalized with respect to the symbol size of ¥ q . 

Noise is embedded, or hard-coded, in the structure of 
the ADT network in the following way. Parallel links of 
£{Vi,V2) deterministic ally model noise between V\ and V 2 . 
Let SNR^y.^y.j be the signal-to-noise ratio from supernode 
Vi to supernode V r Then, \£(V U V 2 )\ = \\ log SNR {VuVj) \ . 
Thus, the number of edges between two supernodes Vi and Vj 
represents the channel quality (equivalently, the noise) between 
the two supernodes. 

Given such a wireless network G = (V, £), let S be the set 
of sources. A source supernode S G S has independent ran- 
dom processes X(S) = [X(S,l),X(S,2),...,X(S,p,(S))\, 
n(S) < |0(5)|, which it wishes to communicate to a set 
of destination supernodes T(S) C T. In other words, we 
want T G T(S) to replicate a subset of the random processes, 
denoted X(S, T) C X(S), by the means of the network. Note 
that the algebraic formulation is not restricted to multicast 
connections; different sources may wish to communicate to 
different subsets of destinations. We define a connection c 
as a triple (S,T, X(S,T)), and the rate of c is defined as 

R(c) = Ex { s^x(s,T) H ( x (S^)) = \X(S,T)\ (symbols). 

Information is transmitted through the network in the fol- 
lowing manner. A supernode V sends information through 
e G 0(V) at a rate at most one symbol per time unit. Let 
Y(e) denote the random process at port e. In general, Y(e), 
e G 0(V), is a function of Y(e'), e' G /(V). In this paper, 
we consider only linear functions. 

Y(e) = ]T P(e',e)Y(e'), for e G 0(V). (1) 

e'eI(V) 

For a source supernode S, and e G O(S), 
Y ( e )= E %,e)Y(e')+ a ii>e) X(S,i). (2) 

e'e-T(V) X(S,i)£X(S) 

Finally, the destination T receives a collection of input pro- 
cesses Y(e'), e' G I{T). Supernode T generates a set of 
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Fig. 8: An example of finite field additive MAC. 

random processes Z{T) = [Z(T, 1), Z(T, 2), Z(T, v(T))] 
where 

Z(T,i)= He>,(T,i))Y(e'). (3) 

e>eI(T) 

A connection c = (S, T, X(S, T)) is established successfully 
if X(S) = Z(T). A supernode V is said to broadcast to a 
set V C V if £(V, V) ^ for all V G V. In Figure gj 5 
broadcasts to supernodes V\ and V 2 . Superposition occurs at 
the input port e' G /(F), i.e. Y(e') = Y,( e ,e')&£ Y ( e ) over 
a finite field F g . We say there is a |V'|-user MAC channel if 
E{V, V)^% for all V G V. In FigureH supernodes V 1 and 
V 2 are users, and T the receiver in a 2-user MAC. 

For a given network G and a set of connections C, we say 
that (G, C) is solvable if it is possible to establish successfully 
all connections c G C. The broadcast and MAC constraints 
are given by the network; however, we are free to choose the 
variables au >e -\, Pi e i >e ), and £(<>',,) from ¥ q . Thus, the problem 
of checking whether a given (G, C) is solvable is equivalent to 
finding a feasible assignment to au e \, /3( e / j6 ), and e( e ',(T,i)V 

Example 3.1: The equations in Figure [6] relate the various 
processes in the example network in Figure Note that in 
Figure |U we have set Y(e\) = a, Y(e 2 ) = b, Y(er) = c, 
Y(eg) = d, and F(eio) = / for notational simplicity. 

A. An Interpretation of the Network Model 

The ADT network model uses multiple channels from 
an output port to model broadcast. In Figure there are 
two edges from output port e\ to input ports and e§; 
however, due to the broadcast constraint, the two edges (e\, e^) 
and (ei,ee) carry the same information a. This introduces 
considerable complexity in constructing a network code as 
well as computing the min-cut of the network IfTI Il2l |[T2l Ifl4l . 
This is because multiple edges from a port do not capture the 
broadcast dependencies. Furthermore, the broadcast dependen- 
cies have to be propagated through the network. 
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In our approach, we remedy this by introducing the use of 
hyperedges, as shown in Figure Q and Section |IV] An output 
port's decision to transmit affects the entire hyperedge; thus, 
the output port transmits to all the input ports connected to 
the hyperedge simultaneously. This removes the difficulties of 
computing the min-cut of ADT networks (Section |V), as it 
naturally captures the broadcast dependencies. 

The finite field additive MAC model can be viewed as a set 
of codes that an input port may receive. As shown in Figure 
|2l input port ei2 receives one of the four possible codes. The 
code that ei2 receives depends on output ports ej's and eio's 
decision to transmit or not. 

The difficulty in constructing a network code does not come 
from any single broadcast or MAC constraint. The difficulty 
in constructing a code is in satisfying multiple MAC and 
broadcast constraints simultaneously. For example, in Figure 
[8j the fact that may receive a + b does not constrain the 
choice of a nor b. This is because we can choose any a and b 
such that a + b ^ 0, and ensure that both a and b are decoded 
as long as enough degrees of freedom are received by the 
destination node. The same argument applies to e$ receiving 
a + c. However, the problem arises from the fact that a choice 
of value for a at e± interacts both with b and c. In such a case, 
we need to ensure that both a + b ^ and a + c ^ 0; thus, 
our constraint is (a + b)(a + c) 7^ 0. As the network grows in 
size, we will need to satisfy more constraints simultaneously. 
As we shall see in Section [VJ we eliminate this difficulty by 
allowing the use of a larger field, ¥ q . 

IV. Algebraic Network Coding Formulation 

We provide an algebraic formulation for the ADT network 
problem (G,C), and present an algebraic condition under 
which the system (G, C) is solvable. We assume that G is 
acyclic in this section; however, we shall extend the results in 
this section to ADT networks with cycles in Section [IX] For 
simplicity, we describe the multicast problem with a single 
source S and a set of destination supernodes T, as in Figure [9] 
However, this formulation can be extended to multiple source 
51,5*2, ---Sk by adding a super-source S as in Figure [10] 

We define a system matrix M to describe the relationship 
between the source's random processes X(S) and the destina- 
tions' processes Z = [Z(Ti), Z(T 2 ), Z(T\ T \)]. Thus, we 
want to characterize M where 

Z = X(S)- M. (4) 

The matrix M is composed of three matrices, A, F, and B. 

A. Adjacency matrix F 

Given G, we define the adjacency matrix F as follows: 

!1 if (e it ej) G £, 

/3 (ei , ei) if a G I(V), ej G 0{V) for V G V, (5) 
otherwise. 

Matrix F is defined on the ports, rather than on the supern- 
odes. This is because, in the ADT model, each port is the basic 
receiver/transmitter unit. Each entry F{ j represents the input- 
output relationships of the ports. A zero entry indicates that 



l(Tj) 




Fig. 9: Single multicast network with source S and receivers 
T\, Tn. 




I<Sk)0(S k ) 



Fig. 10: A network with multiple sources Si, S2, Sk can 
be converted to a single source problem by adding a super- 
source S with 10(5)1 = Y%Li \0(Si)\. Each e' 3 G 0{S) has 
a "one-to-one connection" to a ej £ O(Si), for i G 
Matrix A; represents the encoding matrix for source Si, while 
Bj is the decoding matrix at destination Tj. The white area 
represents the zero elements, and the shaded area represents 
the coding coefficients. 



the ports are not directly connected, while an entry of one 
represents that they are connected. The adjacency matrix F 
naturally captures the physical structure of the ADT network. 
Note that a row with multiple entries of 1 represents the 
broadcast hyperedge; while a column with multiple entries of 
1 represents the MAC constraint. Note that the 0-1 entries 
of F represent the fixed network topology as well as the 
broadcast and MAC constraints. On the other hand, /3( ei , e ) 
are free variables, representing the coding coefficients used at 
V to map the input port processes to the output port processes. 
This is the key difference between the work presented here and 
in ID - F is partially fixed in the ADT network model due 
to network topology and broadcast/MAC constraints, while in 
flU, only the network topology affects F. 

In UllEl, the supernodes are allowed to perform any internal 
operations; while in lfl~2l lfl~4l . only permutation matrices {i.e. 
routing) are allowed. Note that fPH lfl"4l only consider a single 
unicast traffic, in which routing is known to be sufficient. 
References flT] fl2) showed that linear operations are sufficient 
to achieve the capacity in ADT networks for a single multicast 
traffic. We consider a general setup in which /3( e . 6j ) G ¥ q - 
thus, allowing any matrix operation, as in (EDI!). 

Note that F k , the fc-th power of an adjacency matrix of a 
graph G, shows the existence of paths of length k between any 
two nodes in G. Therefore, the series I + F + F 2 + F 3 + ... 
represents the connectivity of the network. It can be verified 
that F is nilpotent, which means that there exists a k such that 
F k is a zero matrix. As a result, I + F + F 2 + F 3 + ... can be 
written as (I — F)^ 1 . Thus, (7 — F) _1 represents the impulse 
response of the network. Note that, (I — F)~ 1 exists for all 



7 









1 








i 


























{) 


1 


n 









































^( e 3 > e 7) 





























n 





Br \ 

















o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 


o 
















































n 




















1 



























































(1 








1 

















n 




















1 














n 




























































System matrix M = A (I-F)' 1 B T 
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Fig. 11: 12 x 12 adjacency matrix F for network in Figure 2] 



acyclic network since I — F is an upper-triangle matrix with 
all diagonal entries equal to 1; thus, det(J — F) = 1. 

Example 4.1: In Figure QT| we provide the 12 x 12 adja- 
cency matrix F for the example network in Figures |4] and 
Q Note that the first row (with two entries of 1) represents 
the broadcast hyperedge, e\ connected to both and e§. The 
last column with two entries equal to 1 represents the MAC 
constraint, both ej and eio transmitting to ei2- The highlighted 
elements in F represent the coding variables, /3( e < je ), of V\ and 
Vi. For some (e',e), /3( e ',e) = since these ports of Vi and 
V-2 are not used. 

B. Encoding matrix A 

Matrix A represents the encoding operations performed at 
S. We define a 1^(5)1 x \£\ encoding matrix A as follows: 

if e 3 e O(S) and X(S, i) G X(S) 
otherwise. 



.4 



'■j 




(6) 



Example 4.2: We provide the 2 x 12 encoding matrix A for 
the network in Figure |4] 



A = 



a l,ei 
a 2,a 



Oil,, 
012, 



C. Decoding matrix B 

Matrix B represents the decoding operations performed at 
the destination T G T. Since there are |T| destination nodes, 
B is a matrix of size \Z\ x \£\ where Z is the set of random 
processes derived at the destination supernodes. We define the 
decoding matrix B as follows: 



Bi 



_ ) e (e i ,(T j ,k)) 





if e, G I{Tj), Z(Tj, k) eZ(Tj), 
otherwise. 

(7) 

Example 4.3: We provide the 2 x 12 decoding matrix B for 
the example network in Figure |4] 

'0 • • • 
,0 ••• 



B = 



e (eil,(T,l)) 
e (eil,(7\2)) 



e (ei2,(7\l)) 
6 (ei2,(T,2)) 



D. System matrix M 

Theorem 4.1: Given a network G = (V,£), let A, B, 
and F be the encoding, decoding, and adjacency matrices, 
respectively. Then, the system matrix M is given by 

M = A(l - F)- l B T . (8) 



X(S)\ 

v 




A 












< > 

\0(S) 






(I-F) 1 








< 


£\ 











\X(S)\ 



A 








V 


M, 




M N 



\£\ 



< — > < — > 

|Z(7i)| \Z(T N )\ 
c \Z\ ' 



Fig. 12: The system matrix M and its components A, (I — 
F)~ l , and B for a single multicast connection with source S 
and destinations Ti, i G [l,iV]. 



Proof: The proof of this theorem is similar to that of 
Theorem 3 in J4). As previously mentioned, (I — F)^ 1 = 
(I + F + F 2 + ...) always exists for an acyclic network G. ■ 

Note that the algebraic framework shows a clear separation 
between the given physical constraints (fixed 0-1 entries of F 
showing the topology and the broadcast/MAC constraints), and 
the coding decisions. As mentioned previously, we can freely 
choose the coding variables a^ ej) , e (e . i(Ti)ft )), and P( eue .y 
Thus, solvability of (G, C) is equivalent to assigning values to 
a (i,ej), e (ei.(Tj.fc))! an d fi{ei,ej) sucn that each receiver T G T 
is able to decode the data it is intended to receive. 

Example 4.4: We can combine the matrices F, A, and B 
from Examples 14.11 14.21 and 14.31 respectively to obtain the 
system matrix M = A(I — F)~ 1 B T for the network in Figure 
H] We show a schematic of the system matrix M in Figure FPU 



V. Definition of Min-cut 

Consider a source S and a destination T. Reference Jl] 
proves the maximal achievable rate to be the minimum value 
of all S-T cuts, denoted mincut(S, T), which we reproduce 
below in Definition 15. II 

Definition 5.1 (Min-cut [l][2|j: A cut SI between a source 
S and a destination T is a partition of the supernodes into two 
disjoint sets SI and Sl c such that S G SI and T G Sl c . For any 
cut, Gq is the incidence matrix associated with the bipartite 
graph with ports in SI and Sl c . Then, the capacity of the given 
ADT network (equivalently, mincut(S, T)) is defined as 



mincut(S,T) = min rank(Gn). 



This rate of mincut(S, T) can be achieved using linear 
operations for a single unicast/multicast connection. ■ 

Note that, with the above definition, in order to compute 
mincut(S, T), we need to optimize over all cuts between S 
and T. In addition, the proof of achievability in jTJ is not 
constructive, as it assumes infinite block length and does not 
consider the details of internal supernode operations. 

We introduce a new algebraic definition of the min-cut, and 
show that it is equivalent to that of Definition 15.11 
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Theorem 5.1: The capacity of the given ADT, equivalently 
the minimum value of all S — T cuts mincut(S, T), is 

mincut(S,T) = minrank(Gfi) 

= max rank(M). 

<*(i,«)Ae',e).«(«',i) 

Proof: By (TJ, we know that mincut(S,T) = 
minn rank(Go). Therefore, we show that max Qj( g. c rank(M) is 
equivalent to the maximal achievable rate in an ADT network. 

First, we show that mincut(S,T) > max a p e rank(M). In 
our algebraic formulation, Z(T) = X(S)M ; thus, the rank of 
M represents the rate achieved. Let R = max 0j 8 je rank(Af ). 
Then, there exists an assignment of otu e \, /3( e ', e ), an d £( e '.i) 
such that the network achieves a rate of R. By the definition 
of min-cut, it must be the case that mincut(S, T) > R. 

Second, we show that mincut(S,T) < max Qi/ 3 :£ rank(M). 
Assume that R = mincut(S,T). Then, by QQ, there 
exists a linear configuration of the network such that we can 
achieve a rate of R such that the destination T is able to 
reproduce X(S,T). This configuration of the network pro- 
vides a linear relationship of the source-destination processes 
(actually, the resulting system matrix is an identity matrix); 
thus, an assignment of the variables a.u ie ),l3( e > t< A, and et e i^\ 
for our algebraic framework. We denote M' to be the system 
matrix corresponding to this assignment. Note that, by the 
definition, M' is an R x R matrix with a rank of R. Therefore, 
max Q , i/ g i£ rank(M) > rank(M') = mincut(S,T), ■ 

The system matrix M (thus, the network and decodability 
at the destinations) depends not only on the structure of the 
ADT network, but also on the field size used, supernodes' 
internal operations, transmission rate, and connectivity. For 
example, the network topology may change with a choice 
of larger field size, since larger field sizes result in fewer 
parallel edges/channels. Another example, if we adjust the 
rate such that \X(S)\ < mincut(S,T), then M is full rank. 
However, if ^(S 1 )! > mincut(S,T), then M may have rank 
of mincut(S, T) but not be full-rank. It is important to note 
that the cut value in the ADT network may not equal to the 
graph theoretical cut value (see Figure 2 in lTT2l ). 

VI. Min-cut Max-flow Theorem 

In this section, we provide an algebraic interpretation of the 
Min-cut Max-flow theorem for a single unicast connection and 
a single multicast connection (HE) - This result is a direct con- 
sequence of B when applied to the algebraic formulation for 
the ADT network. We also show that a distributed randomized 
coding scheme achieves the capacity for these connections. 

Theorem 6.1 (Min-cut Max-flow Theorem): Given 
an acyclic network G with a single connection 
c = (S,T,X(S, T)) of rate R(c) = \X(S,T)\, the following 
are equivalent: 

1) A unicast connection c is feasible. 

2) mincut(S,T) > R(c). 

3) There exists an assignment of au e .}, e( ei ,(T-,&))' ar *d 
/?( ei)e .) such that the R(c) x R(c) system matrix M is 
invertible in ¥ q (i.e. det(M) ^ 0). 



Proof: Statements 1) and 2) have been shown to be 
equivalent in ADT networks JT] lfl2l lfT4l . We now show that 1 ) 
and 3) are equivalent. Assume that there exists an assignment 
such that det(M) ^ in ¥ q . Then, the system matrix M is 
invertible; thus, there exists M _1 such that X(S) = ZM~ Y , 
and a connection of rate R(c) — \X(S, T)\ is established. 
Conversely, if connection c is feasible, there exists a solution 
to the ADT network G that achieves a rate of R(c). When 
using this ADT network solution, the destination T is able 
to reproduce X(S,T); thus the resulting system matrix is an 
identity matrix, M = I. Therefore, M is invertible. ■ 
Corollary 6.1 (Random Coding for Unicast): Consider an 
ADT network problem with a single connection c = 
{S,T,X(S,T)) of rate R(c) = \X(S, T) < mincut(S,T). 
Then, random linear network coding, where some or all code 
variables e( ei ,(T-,k))> an d 0(e it e-) aie chosen inde- 

pendently and uniformly over all elements of ¥ q , guarantees 
decodability at destination T with high probability at least 
(1 - i)* 7 , where r\ is the number of links carrying random 
combinations of the source processes. 

Proof: From Theorem 16.11 there exists an assignment of 

«(j, ej ). e( ei ,(T,,fc))' and P(e t ,ej) such mat det(M) ^ 0, which 
gives a capacity-achieving network code for the given (G,C). 
Thus, this connection c is feasible for the given network. 
Reference ifTTl proves that random linear network coding 
is capacity-achieving and guarantees decodability with high 
probability (1 — |)'' for such a feasible unicast connection c. 

■ 

Theorem 6.2 (Single Multicast Theorem): Given an acyclic 
network G and connections C = {(S,T U X(S)), (S,T 2 , 
X(S)), ...,(S,T N ,X(S))}, (G,C) is solvable if and only if 
mincut(S,T t ) > \X(S)\ for all i. 

Proof: If (G, C) is solvable, then mincut(S,Ti) > 
1^(5)1. Therefore, we only have to show the converse. As- 
sume mincut(S,Ti) > \X(S)\ for all i 6 [1,-AT]- The system 
matrix M = {Mi} is a concatenation of \X(S)\ x \X(S)\ 
matrices where Z(Ti) = X(S)Mi, as shown in Figure [T5] 
We can write M = [M 1 ,M 2 ,...,M N ] = A(I - F)~ 1 B T = 
A(I-F)-~ L {B 1 ,B 2 , B N }. Thus, M. t = A(I-F)- 1 B. l . Note 
that A and B^'s do not substantially contribute to the system 
matrix Mj since A and Bi only perform linear encoding and 
decoding at the source and destinations, respectively. 

By Theorem 16.11 there exists an assignment of au e .y 
e(ei,(r-,fc))> an d /3( e<J e ) sucn that each individual system 
submatrix Mj is invertible, i.e. det (Mi) ^ 0. However, an as- 
signment that makes det (Mi) ^ may lead to det (Mj) = 
for i / j. Thus, we need to show that it is possible to 
achieve simultaneously det (Mi) ^ for all i (equivalently 
FJi det (Mi) ^ 0). By UJJ, we know that if the field size 
is larger than the number of receivers (q > N), then there 
exists an assignment of au e .\, et ei ,(T-,k))> an d /3(e 4 ,e ) suc h 
that det (Mi) ^ for all i. ' 3 ■ 

Corollary 6.2 (Random Coding for Multicast): Consider 
an ADT network problem with a single multicast connection 
C = {(S,T 1 ,X(S)),(S,T 2 ,X(S)),...,(S,T N ,X(S))} with 
mincut(S,Ti) > 1^(5)1 for all i. Then, random linear 
network coding, where some or all code variables Q!(j ie .), 
e(e i ,(r-,fc))> ar, d /?(ei,e) ^ chosen independently and 
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uniformly over all elements of F g , guarantees decodability at 
destination Tj for all i simultaneously with high probability 
at least (1 — y) 17 , where 77 is the number of links carrying 
random combinations of the source processes; thus, <q < \£\. 

Proof: Given that the multicast connection is feasible 
(which is true by Theorem l6.2t . reference ifPTl shows that ran- 
dom linear network coding achieves the capacity for multicast 
connections, and allows all destination supernodes to decode 
the source processes X(S) with high probability of at least 
(1 - f )". ■ 
Theorem 16.11 and Theorem 16.21 provide an alternate proof 
of sufficiency of linear operations for unicast and multicast in 
ADT networks, which was first shown in 0]. 

VII. Extensions to other connections 

In this section, we extend the ADT network results to a 
more general set of traffic requirements. We use the algebraic 
formulation and the results from H to characterize the feasi- 
bility conditions for a given problem (G,C). 

A. Multiple Multicast 

Theorem 7.1 (Multiple Multicast Theorem): Given a net- 
work G and a set of connections C = {(Si,Tj, X(Si)) \ Si 6 
S, Tj £ T}, (G, C) is solvable if and only if Min-cut Max- 
flow bound is satisfied for any cut that separates the source 
supernodes S and a destination Tj, for all Tj £ T. 

Proof: We first introduce a super-source S with = 
Ss-eS 1^(^)1' anc l connect each e'j £ O(S) to an input of 
Si such that ej £ O(Si) as shown in Figure \W\ Then, we 
apply Theorem 16.21 which proves the statement. ■ 

Corollary 7.1 (Random Coding for Multiple Multicast): 
Consider an ADT network problem with multiple multicast 
connections C = {(Si,Tj, X(Si))\Si £ S,Tj e T} with 
mincut(S,Tj) > J2i\X(Si)\ for all i. Then, random 
linear network coding, where some or all code variables 
a (i.ej), e(e j ,(T-,fc)) ! and /3( ei>e .) are chosen independently and 
uniformly over all elements of ¥ q , guarantees decodability at 
destination Tj for all i simultaneously with high probability 
at least (1 - ^) r ', where rj is the number of links carrying 
random combinations of the source processes; thus, rj < \£\. 

The optimality of random coding in Corollary 17.11 comes 
from the fact that we allow coding across multicast connec- 
tions (Si,Tj,X(Si))'s - i.e. , the source supernodes and the 
intermediate supernodes can randomly and uniformly select 
the coding coefficients. Thus, intermediate nodes within the 
network do not distinguish the flow from source Sj from that 
of Sj, and are allowed to encode them together randomly. 

B. Disjoint Multicast 

Theorem 7.2 (Disjoint Multicast Theorem): Given an 
acyclic network G with a set of connections C = {(S,Ti, 
X{S,Ti)) I i = 1,2, K} is called a disjoint multicast 
if X(S,Ti) n X(S,Tj) = for all i ^ j. Then, (G,C) is 
solvable if and only if the min-cut between S and any subset 
of destinations T' C T is at least 2~2r eT' \X(S>Ti)\, *•<?• 
mincut(S,V) > Er«eT' \ x ( s > T i)\ for any V c T. 



KT,) 0(T,) 




I(T) System matrix 



M 





\B/\ 




(I-F)i 







M, 



I(T K )0(T K ) 



Fig. 13: Disjoint multicast problem can be converted into a 
single destination problem by adding a super-destination T. 
The system matrix M for the disjoint multicast problem is 
shown as well. Note that unlike the multicast problem in 
Figure [9] where M = [Mi, M2, Mjf], the system matrix 
M is a diagonal concatenation of Mi's. 



Proof: Create a super-destination supernode T with 
\I(T)\ = Y,f=i \I{Ti)\, and an edge (e, e') from e £ 0{T), 
i £ [1,-K] to e! £ I(T), as in Figure [T3] This converts the 
problem of disjoint multicast to a single-source S, single- 
destination T problem with rate X(S,T) = J2 T , eT \X{S,T)\. 
The mincut(S,T) > \X(S, T)\; so, Theorem I67T1 applies. 
Thus, it is possible to achieve a communication of rate 
X(S, T) between S and T. Now, we have to guarantee that 
the receiver T, is able to receive the exact subset of processes 
X(S,Ti). Since the system matrix to T is full rank, it is 
possible to carefully choose the encoding matrix A such that 
the system matrix M at super-destination supernode T is 
an identity matrix. This implies that for each edge from the 
output ports of Ti (for all i) to input ports of T is carrying 
a distinct symbol, disjoint from all the other symbols carried 
by those edges from output ports of Tj, for all i ^ j. Thus, 
by appropriately permuting the symbols at the source, S can 
deliver the desired processes to the intended Tj as shown in 
Figure Qj] ■ 

Random linear network coding with a minor modification 
achieves the capacity for disjoint multicast. We note that only 
the source's encoding matrix A needs to be modified. The 
intermediate supernodes can randomly and uniformly select 
coding coefficients e( ei ,(T-,fc)) an d /3( ei ,e ) over ai l elements 
of F g . Once these coding coefficients at the intermediate 
supernodes are selected, S carefully chooses the encoding 
matrix A such that the system matrix corresponding to the 
receivers of the disjoint multicast is an identity matrix, as 
shown in FigureQj] To be more precise, when e( ei ,(T ,fe)) an d 
(3( e . _ e ) are randomly selected over elements of ¥ q , with high 
probability, (7 — F)~ 1 B T is full rank. Thus, there exists a 
matrix A such that A(I — F)~ 1 B T is an identity matrix I. 
Note that A(I — F)~ 1 B T does not need to be an identity 
matrix - it only needs to have a diagonal structure as shown 
in Figure QjJ however, being an identity matrix is sufficient 
for proof of optimality. 

We note another subtlety here. Theorem 17 .2 1 holds precisely 
because we allow the intermediate nodes to code across all 
source processes, even they are destined for different receivers. 
This takes advantage of the fact that the single source can 
cleverly pre-code the data. 
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Fig. 14: The system matrix M for the two-level multicast prob- 
lem. The structure of the system matrix M is a "concatenation" 
of the disjoint multicast problem (Figure Q~3j and the single 
multicast problem (Figure |9j. 



C. Two-level Multicast 

Theorem 7.3 (Two-level Multicast Theorem): Given an 
acyclic network G with a set of connections C = Cd U C m 
where C d = {(S,T i ,X(S,T i ))\X(S,T i ) n X(S,Tj) = 0, 
i h hi S [1,/v]} is a disjoint multicast connection, and 
C m = {(S,T h X(S)) | i e [K + 1,N}} is a single source 
multicast connection. Then, (G,C) is solvable if and only if 
the min-cut between S and any T' Q {T\, ...,Tr-} is at least 
J2t gT' ^(^^i)!' an d me min-cut between S and Tj is at 
least \X(S)\ for j e [K+1,N]. 

Proof: We create a super-destination T for the disjoint 
multicast destinations as in the proof for Theorem 17.21 Then, 
we have a single multicast problem with receivers T and 
{T % \i G [K + l,N]}. Therefore, Theorem lo\2l applies. By 
choosing the appropriate matrix A, S can satisfy both the 
disjoint multicast and the single multicast requirements, as 
shown in Figure [14] ■ 

As in the disjoint multicast case, random linear network 
coding with a minor modification at the source achieves the 
capacity for two-level multicast. Note that, receivers Ti, i G 
[K + 1 , N ] are of no concern - the source S can randomly 
choose coding coefficients otu t e ) to achieve a full-rank system 
matrix M*. Thus, S needs to carefully choose the encoding 
matrix A to satisfy the disjoint multicast constraint, which 
can be done as shown in Section IVII-BI 

Theorem 17.31 does not extend to a three-level multicast. 
Three-level multicast, in its simplest form, consists of con- 
nections {(S,Ti,X(S,Ti))\ i e [1,3]} where X(S,T 1 ) c 
X(S,T 2 ) c X(S,T 3 ). 

D. General Connection Set 

In the theorem below, we present sufficient conditions for 
solvability of a general connection set. This theorem does not 
provide necessary conditions, as shown in ||29l . 

Theorem 7.4 (Generalized Min-cut Max-flow Theorem): 
Given an acyclic network G with a connection set C, let 
M = {Mij} where M$ j is the system matrix for source 
processes X(Si) to destination processes Z(Tj). Then, (G,C) 
is solvable if there exists an assignment of a^ e .\, £( ei ,(Tj,fc))' 



and /3( e . 



such that 



1) M id = for all (Si, T h X(S i ,T j )) $ C, 

2) Let (S^ {) ,T j ,X(S a{{) ,T j )) e C for i e [l,K(j)]. 
Thus, this is the set of connections with Tj as a receiver. 
Then, [Mj (1)j ,Mj (2)j ,..., M% Kj) J is a \Z(Tj))\ x 
|Z(Tj)| is a nonsingular system matrix. 



Proof: Note that [Mj (1)> ., Mj (2) j , } is a 
system matrix for source processes X(S a (f\), i £ [l,K(j)] t 
to destination processes Z(Tj). 

Condition 2) states the Min-cut Max-flow condition; thus, 
is necessary to establish the connections. Condition 1) states 
that the destination supernode Tj should be able to distinguish 
the information it is intended to receive from the information 
that may have been mixed into the flow it receives. These two 
conditions are sufficient to establish all connections in C. The 
proof is similar to that of Theorem 6 in J4] . ■ 

VIII. Network with Random Erasures 

We consider the algebraic ADT problem where links may 
fail randomly, and cause erasures. Wireless networks are 
stochastic in nature, and random erasures occur dynami- 
cally over time. However, the original ADT network models 
noise deterministically with parallel noise-free bit-pipes. As 
a result, the min-cut (Definition 15. It and the network code 
|[T2llfT4ll28l . which depend on the hard-coded representation 
of noise, have to be recomputed every time the network 
changes. 

We show that the algebraic framework for the ADT network 
is robust against random erasures and failures. First, we 
show that for some set of link failures, the network code 
remains successful. This translates to whether the system 
matrix M preserves its full rank even after a subset of variables 
a (i.ej)i £ (e i ,(D j .k))> an d P(ei,e ) associated with the failed links 
is set to zero. Second, we show that the specific instance 
of the system matrix M and its rank are not as important 
as the average rank(A/) when computing the time average 
min-cut. Note that the original min-cut definition (Definition 
15. jj requires an optimization over an exponential number of 
cuts for every time step to find the average min-cut. We shall 
use the results from ll30l to show that random linear network 
coding achieves the time-average min-cut; thus, is capacity- 
achieving. 

We assume that any link within the network may fail. Given 
an ADT network G and a set of link failures f,Gf represents 
the network G experiencing failures /. This can be achieved 
by deleting the failing links from G, which is equivalent to 
setting the coding variables in B(f) to zero, where B(f) is 
the set of coding variables associated with the failing links. 
We denote M be the system matrix for network G. Let Mf 
be the system matrix for the network Gf. We do not assume 
that the link failures are static; thus, we can consider a static 
link failure patterns, a distribution over link failures patterns, 
or a sequence fi,f2,fs--- of link failures. 

A. Robust against Random Erasures 

Given an ADT network problem (G, C), let T be the set of 
all link failures such that, for any / G T, the problem (Gf, C) 
is solvable. The solvability of a given (Gf,C) can be verified 
using resulting in Sections [VI] and IVIII We are interested in 
static solutions, where the network is oblivious of /. In other 
words, we are interested in finding the set of link failures such 
that the network code is still successful in delivering the source 
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processes to the destinations. For a multicast connection, we 
show the following surprising result. 

Theorem 8.1 (Static Solution for Random Erasures): 
Given an ADT network problem (G, C) with 
a multicast connection C = {(S, Ti, X(S)), 
(S,T 2 ,X(S)),...,(S,T N ,X(S))}, there exists a static 
solution to the problem (Gf,C) for all / G T. In other words, 
there exists a fixed network code that achieves the multicast 
rate despite any failures / G T. 

Proof: By Theorem 16.21 we know that for any given / G 
JF, the problem (Gf,C) is solvable; thus, there exists a code 
det (Mf) 0. Now, we need to show that there exists a code 
such that dot (A//) ^ for all / <E J- simultaneously. This 
is equivalent to finding a non-zero solution to the following 
polynomial: 11/6^- ^et (Mf) ^ 0. Reference ifTTl showed that 
if the field size is large enough (q > |.F||T| = \J-\N), then 
there exists an assignment of a^ ej ), e( ei ,(D j ,k))> and P( ei , ej ) 
such that det (Mf) ^ for all / G T. ■ 

Corollary 8.1 (Random Coding against Random Erasures): 
Consider an ADT network problem with a multicast connec- 
tion C = {(S,Tx,X(S)), (S,T 2 ,X(S)),...,(S,T N ,X(S))}, 
which is solvable under link failures /, for all / G T . 
Then, random linear network coding, where some or all 



code variables a 



(i.e^eCen.CD^fe))' 



and /3 (e 



are chosen 



independently and uniformly over all elements of F 9 
guarantees decodability at destination supernodes Tj for 
all i simultaneously and remains successful regardless of 
the failure pattern / 6 T with high probability at least 



(1 



JVI.FI 



'', where r) is the number of links carrying random 



combinations of the source processes. 

Proof: Given a multicast connection that is feasible under 
any link failures / G F, fTTIl showed that random linear 
network coding achieves the capacity for multicast connec- 
tions, and is robust against any link failures / G J- with high 
probability (1 - ^)". ■ 
We note that it is unclear whether this can be extended to 
the non-multicast connections, as noted in (4)- Reference (4) 
shows a simple example network in which no static solution 
is available for a set of feasible failure patterns. 



B. Time-average Min-cut 

In this section, we study the time-average behavior of the 
ADT network, given random erasures. We use techniques 
from ll30l . which studies reliable communication over lossy 
networks with network coding. 

Consider an ADT network G. In order to study the time- 
average steady state behavior, we introduce erasure distribu- 
tions. Let T 1 be a set of link failure patterns in G. A set of 
link failures / G J- 1 may occur with probability pf. 

Theorem 8.2 (Min-cut for Time-varying Network): 
Assume an ADT network G in which link failure pattern 
/ G T' occurs with probability Pf. Then, the average min-cut 
between two supernodes S and T in G, mincutjri(S,T) is 



mincutjri (S, T) 



max 



mk(M f ) 



Proof: By Theorem 15. II we know that at any given time 
instance with failure pattern /, the min-cut between S and T 
is given by max Q( . e))( g., e) ,e (e , ;) rank(M^). Then, the above 
statement follows naturally by taking a time average of the 
min-cut values between S and T. ■ 

The key difference between Theorems 18.11 and 18 . 2 1 is that in 
Theorem 18.11 any / G T may change the network topology 
as well as min-cut but mincut(S,T) > \X(S)\ holds for all 
/ G T - i.e. (Gf,C) is assumed to be solvable. However, in 
Theorem 18.21 we make no assumption about the connection 
as we are evaluating the average value of the min-cut. 

Unlike the case of static ADT networks, with random 
erasures, it is necessary to maintain a queue at each supernode 
in the ADT network. This is because, if a link fails when a 
supernode has data to transmit on it, then it will have to wait 
until the link recovers. In addition, a transmitting supernode 
needs to be able to learn whether a packet has been received 
by the next hop supernode, and whether it was innovative 

- this can be achieved using channel estimation, feedback 
and/or redundancy. In the original ADT network, the issue of 
feedback was removed by assuming that the links are noiseless 
bit-pipes. We present the following corollaries under these 
assumptions. 

Corollary 8.2 (Multicast in Time-varying Network): 
Consider an ADT network G and a multicast connection 
C = {(S,T 1 ,X(S)),...,(S,T N ,X(S))}. Assume that failures 
occur where failure patten / G T' occurs with probability 
Pf. Then, the multicast connection is feasible if and only if 
mincutjr,(S,T t ) > \X(S)\ for all i. 

Proof: Reference 11301 shows that the multicast connection 
is feasible if and only mincut^>(S, Ti) > \X(S)\ for all i. The 
proof in ll30l relies on the fact that every supernode behaves 
like a stable M/M/l queuing system in steady-state, and thus, 
the queues (or the number of innovative packets to be sent to 
the next hop supernode) has a finite mean if the network is 
run for sufficiently long period of time. ■ 

Corollary 8.3 (Random Coding for Time-varying Network): 
Consider (G,C) where C is a multicast connection. Assume 
failure pattern / G T' occurs with probability pf. Then, 
random linear coding, where some or all code variables 
a(»,e J -))/3(e<,e J -))e(e i ,(Di,fc)) are chosen over all elements of 
W q guarantees decodability at destination nodes Tj for all i 
simultaneously with arbitrary small error probability. 

Proof: This is a direct consequence of Corollary 18.21 and 
results in El ESI. ■ 

IX. Network with Cycles 

ADT networks are acyclic, with links directed from the 
source supernodes to the destination supernodes. However, 
wireless networks intrinsically have cycles as wireless links 
are bi-directional by nature. In this section, we extend the 
ADT network model to networks with cycles. In order to 
incorporate cycles, we need to introduce the notion of time 

- since, without the notion of time, the network with cycles 
may not be casual. To do so, we introduce delay on the links. 
As in Q, we model each link to have the same delay, and 
express the network random processes in the delay variable 
D. 
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Fig. 15: 12 x 12 matrix (/ — DF) 1 for network in Figure 2] The matrix F can be found in Figure fTTI 



We define X t (S,i) and Z t (T,j) to be the i-fh and j-th 
binary random process generated at source S and received 
at destination T at time t, for t = 1,2,.... We define 
Y t (e) to be the process on edge e at time t = 1,2,..., 
respectively. We express the source processes as a power series 
in D, X(S, D) = [X(S,1, D), X(S, 2, D), X(S, p,(S), D)} 
where X(S,i,D) = YltLo Xt(S, i)-D*. Similarly, we express 
the destination random processes Z(T,D) = [Z(T, 1,D), 
...,Z(T,v{Z),D)] where Z(T,i,D) = J2Zo Z t {T,i)D l . In 
addition, we express the edge random processes as Y t (e, D) = 
J2u=o Yt(e)D f - Then, we can rewrite Equations ([U and (O as 



Y t+1 (e)= P{e',e)Yt{e') 

e'El(V) 



^2 a (i,e)Xt(S,i). 
X t (S,i)£X(S) 



Furthermore, the output processes Z t (T, i) can be rewritten as 

Z t+l (T,i)= J2 e e',(T, 4 )y t (e')- 

e'GI(T) 

Using this formulation, we can extend the results from H to 
ADT networks with cycles. We show that a system matrix 
M(D) captures the input-output relationships of the ADT 
networks with delay and/or cycles. 

Theorem 9.1: Given a network G = (V,£), let A(D), 
B(D), and F be the encoding, decoding, and adjacency 
matrices, as defined here: 



.4 



B, 



1,3 



a {i , ej ){D) if ej e O(S) and X(S,i) & X(S), 







otherwise. 



e {ei , {Tj , k) )(D) if e t G I(Tj), Z{T h k) G Z{T,) 
otherwise. 



and F as in Equation ([5). The variables a^ ie .){D) and 
e (ei,(Tj,k))(D) can either be constants or rational functions in 
D. Then, the system matrix of the ADT network with delay 
(and thus, may include cycles) is given by 



M(D) = A(D) ■ (I - DF)- 1 ■ B{Df 



(9) 



Proof: The proof for this is similar to that of Theorem 



Similar to Section [IV] (/ — DF)~ l represents the impulse 
response of the network with delay. This is because the series 
/ + DF + D 2 F 2 + D 3 F 3 + ... represents the connectivity of 
the network while taking delay into account. For example, F k 
has a non-zero entry if there exists a path of length k between 
two port. Now, since we want to represent the time associated 



with traversing from port e, to e.j, we use D k F k , where D k 
signifies that the path is of length k. Thus, (/ — DF)- 1 = 
I + DF + D 2 F 2 + D 3 F 3 + ... is the impulse response of 
the network with delay. An example of (7 — DF)- 1 for the 
example network in Figure [7] is shown in Figure Q3] 

Using the system matrix M(D) from Theorem 19.11 we can 
extend Theorem 16.11 Theorem 16.21 Theorem 17.11 Theorem 
17.21 and Theorem 17.31 to ADT networks with cycles/delay. 
However, there is a minor technical change. We now operate 
in a different field - instead of having coding coefficients 
from the finite field ¥ q , the coding coefficients a^ i e .^(D) and 
£((ei,(Tj,k)))(D) ar e now from ¥ q (D), the field of rational 
functions of D. We shall not discuss the proofs in detail; 
however, this is a direct application of the results in |@]. 

X. Code Construction for Multicast Connection 

As presented in Sections IVIBXI random linear network 
coding, which is completely distributed, requires randomiza- 
tion. As a result, random linear network coding achieves the 
capacity for the ADT network model with high probability. 
Similarly, |[24l introduced a distributed binary-vector network 
code, called permute-and-add, in which each node randomly 
and uniformly selects a permutation matrix from all such 
matrices for its coding operation. In ll24l . they show that 
permute-and-add is still optimal for multicast connections - 
i.e. achieves the capacity with high probability. Thus, network 
codes in ¥ q , q > 2, can be converted to vector code in binary 
field, (¥2) log2 ^ q \ without loss in performance. As a result, 
permute-and-add can be applied to the ADT network model. 
However, it is important to note that a randomized, distributed 
approach does not guarantee decodability (with probability 1). 

In this section, we propose an efficient code construction 
for multicast connection in ADT network, which guarantees 
all destination supernodes to decode if the connection is 
feasible. Furthermore, we only require that there be some local 
coordination among neighboring supernodes (more precisely, 
among supernodes within a layer), as we shall discuss in 
Section IX-C2I 

Given an acyclic ADT network G = (V, £ ) and a mul- 
ticast connection C = {(S, T t , X{S))\Ti E T}, we con- 
sider the problem of efficiently finding an assignment for 
a (i,ej)i e (e i ,(Dj,fc))> ar, d P( ei} ej) sucn that the system matrix 
M is invertible in ¥ q . Note that we assume that (G,C) 
is solvable - i.e. mincut(S,Ti) > R for all i, where 
R = |<-t(S)| is the multicast rate. We shall consider R = 
minXiGT mincut{S, T{). 
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Layer 1+1 




Fig. 16: Supernodes V\ and V2. 



For the given multicast rate R, we define the set 



Layer 1 



Layer 2 



Layer 3 




Fig. 17: A set W of input/output ports (in black) where R = 
3. The supernode and the ports shaded in grey represent the 
virtual sink T(W). Note that W is not regular. 



V(R) '{Fe V\mincut(S, V) > R}. 



(10) 



A property of our code construction is that all supernodes in 
V(R) will be able to decode the data, including those that are 
not in the set of destination supernodes T. We also note that 
the code designer may be oblivious of the exact location of 
the nodes in T or V(R). 

We assume that each supernode V G V contains n input 
ports and n output ports, where 



-log max SNR/ V . V .\ 
2 B (y i ,v j )&e (Vz ' Vj) 



(11) 



Therefore, we assume that |/(V)| = |0(V)| = n. For ease of 
notation, we distinguish the input ports of supernode V by 



and the output ports by 



(12) 



(13) 



The network is assumed to have A layers, where all links 
are from layer I to layer I + 1 for I = 1, A — 1. The source 
is at layer 1. We assume that there are at most Ni ayer nodes 
at any layer I 6 [1,A]. Since the network is acyclic, we can 
arrange all the ports in a topological order. The input ports of a 
certain supernode always precede the output ports of the same 
supernode. In addition, we adopt the convention that ports of 
supernodes in layer I precede all the ports of supernodes in 
layer I + 1, for I = 1, • • • , A— 1. We make the assumption that 
within a single layer, the supernodes are ordered from top to 
bottom. Also, within each supernode, ports are arranged from 
top to bottom. 

We denote P{x() to be the set of output ports that have 
links incoming into the input port x^ of a supernode V in layer 
I. By assumption, < |P(cc^)| < Ni ayer - i.e. there are at 
most Ni ayer edges incoming to xj from output ports in layer 
I — 1. We denote C{yJ) to be the set of input ports that have 
links outgoing to the output port yj of a supernode V in layer 
I. Note that < \C(yJ)\ < N[ ayer since yj may be adjacent 
to input ports of different supernodes in layer I + 1, but may 
not be adjacent to more than one input port per supernode in 
layer Z + 1. 



A. Regular Sets and Virtual Sinks 

For the algorithm we describe in the following sections, we 
will use an invariant which will be maintained throughout the 
algorithm. Prior to defining this invariant, we need to introduce 
the concepts of regular sets and virtual sinks. 

Consider supernodes at a certain layer I. We consider a 
set W of R ports, where for any supernode V the set W 
contains a subset of V's output ports, or a subset of V's input 
ports, but not both, or no ports of V. The set W may contain 
ports of several supernodes. Now consider the following. If 
W contains output ports of supernode V then we connect 
each of the output ports to a "virtual sink" T(W), which is a 
supernode consisting of its own ports. If W contains p input 
ports of supernode V, then we disconnect all the input ports 
of supernode V that are not in W. We connect the p upper 
output ports of V to the virtual sink T{W). The order in which 
the output ports are connected to T(W) is not important. For 
consistency, however, we assume that the output ports of layer 
I that are connected to sink T(W), are connected to the input 
ports of T(W) from top to bottom, each output port to a 
different input port of T(W). See Figure [l7]for an illustration. 

Definition 10.1 (Regular set): The set W is said to be 
regular if mincut(S, T{W)) = R. 

In Figure [T7] we show an example of W where R = 3, and 
a virtual sink T(W). The set W is not regular since T(W) 
receives only rate of 2. 

The important property of regular sets, as we shall observe, 
is that there exists a network code such that the coding vectors 
of the regular sets are linearly independent. We shall exploit 
this property for our code construction. 

B. Overview of Coding Scheme 

We proceed through the ports in the topological order, and 
for each port we reach, we choose the coding coefficients, 
taken from ¥ q , where q is the field size to be determined. 
Each port has a coding vector associated with it, where coding 
vector represents the coding coefficients used to generate the 
linear combination. Denote the coding vectors of the input 
ports of supernode V by 



{ x l > ' ' ' ? x n }i 



(14) 



where e is the coding vector associated with the input 
port ■ Similarly, we denote the coding vectors of the output 
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ports of supernode V by 

y° (v) = {ylV--,yr}, (15) 

where yj G is the coding vector associated with the output 
port yj. The coding vector yj is 

n 
i=l 

where /3( x y y y\ G ¥ q are the supernode internal coding 
coefficients, as described in Section [IV] 

Although we introduce new notation for the coding vectors 
for convenience, these vectors can still be captured by the 
system matrix M = A(I — F)~ 1 B T . Consider an input port 
xf ■ Assume that x( is the pth port in the topological order. 
Then, the coding vector ~x\ at this input port is pth column of 
the matrix A(I — F)^ 1 . This is because what input port xj 
receives depends on two things: first, the sources encoding 
operations, represented by A; second, the network connec- 
tivity and coding operations performed within the network, 
represented by (J — F)^ 1 . The same argument applies to the 
output ports yj. Thus, the columns of A(I — F)' 1 is equal 
to the coding vectors at the corresponding input/output ports 
in the network. 

We refer to this coding operation in Equation (PTol as "for- 
ward coding". Once the coding vector yj of the output port 
is determined, we can multiply it by the coding coefficient kj . 
We refer to this step as "virtual coding". The "virtual coding" 
can be incorporated into the "forward coding". However, we 
separate the coding into two distinct phases for purposes of 
presentation. 

For an input port x( , let y 1; • • • , y p be the coding vectors 
of the output ports in the set P(xY). Then, the coding vector 
of the input port xj is given by 

v 

where kj G ¥ q are the virtual coding coefficients. Note 
that only a single coefficient kj is chosen for all ports in 
C(yj), which requires that the supernodes in the same layer 
coordinate locally when determining kj's. 

Definition 10.2 (Cut of ports): Consider the coding 
scheme, which assigns coding coefficients in a topological 
order, as mentioned above. Let t be the time index. We denote 
Ct = (S, S c ) to be the current cut of the algorithm, where 
S is the set of ports whose coding coefficients have been 
determined, and S c the set of remaining ports. An output port 
yj is in S if the coding coefficient /3( x y y y^ of its supernode 

V have already been determined. An input port x( is in S 
if all of the virtual coefficients kj of output ports in P(xJ) 
have been determined. The input ports of the source are in S. 

A cut of ports Ct is not necessarily a cut of supernodes. 
In a cut of ports, ports of the same supernode can be in two 
different parts of the cut Ct- We do, however, restrict ourselves 
to a specific type of cuts of ports - all the input ports of a 
specific supernode are on the same side of the cut, and all the 
output ports are on the same side of the cut. 




Fig. 18: An example of Qc t (in black). 



Definition 10.3 (Boundary set): A subset of ports Qc t is a 
boundary set if 

Qc t = {Input ports in S with outputs in S c } U 

{Output ports in S with edges outgoing to S c }. 

Since the topological order proceeds from top to bottom at 
each layer, if the boundary set contains the input ports of a 
certain supernode V at layer I, then it will also contain the 
input ports of the supernodes that are above supernode V at 
layer I. Similarly, if the boundary set contains the output ports 
of a certain supernode V then it will also contain the output 
ports of the supernodes that are above it at layer I. Figure [T"8l 
shows an example of a boundary set, Qc t . 

Lemma 10.1: R<n< \Qc t \ _■ nNi ayer . 

Proof: Consider a cut of supernodes where source 
supernode il = {S} and fl c = V \ {$}. Since S has n output 
ports, we conclude that R < n since R is upper bounded by 
the min-cut. By definition, the output ports of a supernode 
V are restricted to be on the same side of the cut Ct- The 
same is true for the input ports of a supernode. It follows that 
there are at least n ports in Qc t - i.e. n < \Qc t \- Since the 
network is assumed to be layered, and the maximal number of 
supernodes in a layer is Ni ayer , it follows from the definition 
of Q Ct that \Q Ct \< nN layer . ■ 

The code construction considers each subset of R ports in 
Qc t - Some of the subsets we shall consider are regular sets. 
Define Ct by 

Ct = {Regular subsets of Qc t of size R}. (18) 
The number of subsets in Ct is upper bounded by 

w<^<( nN T r } (19) 

Code Invariant: Ensure that at each stage of the code 
construction, each subset in Ct is associated with linearly 
independent coding vectors. 

Lemma 10.2: Maintaining the invariant of the algorithm is 
sufficient to ensure the decodability of the code at rate R. 

Proof: For a (non- virtual) sink Tj G T, let W C 0{Tj) 
be the R upper output ports of Tj . By the definition of R, we 
have mincut(S,Tj) > R. We connect the ports in W to a 
virtual sink T(W) with R edges, where the zth output port of 
Tj is connected to the zth input port of T{W). It follows that 
R = mincut(S,T(W)). Therefore, the set W is regular. 

The invariant ensures that the set W will be associated 
with linearly independent coding vectors. It follows that Tj 
will be able to decode the data, as required by the code. The 
same argument can be applied to all nodes in V(R). The 
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linearly independent vectors of the regular sets can be used 
to reconstruct the data of the source by matrix inversion Q. ■ 
As the algorithm proceeds, ports may leave or enter Qc t - 
The list Ct is then updated accordingly, as we shall discuss in 
the following sections. 

C. Algorithm Description 

The algorithm starts from the upper R input ports of the 
source S with the standard basis as their coding vectors. The 
lower input ports of the source have the zero vector as their 
coding vectors. The source bits are mapped into the source 
symbols in the field F„, where q = 2 k for some k. The 
transmission is over block length k = Iog|FJ. The vector 
of source symbols is u = (ux, ■ ■ ■ , ur) t , where Ui 6 F,. The 
symbol received by a port with coding vector x G F^ is x T u. 

Trivially, the invariant of the algorithm is maintained for 
the source S. At each step of the algorithm, we proceed to the 
next port in the topological order, determine and update the 
followings. 

1) The coding coefficients for the new port (and thus the 
coding vectors). 

2) The updated list C t according to the new cut Ct- 

To do so, we shall treat the input and the output ports 
separately, as discussed in the subsequent sections. For each 
layer, the coding coefficients for the input ports have to be 
determined before the coding coefficients for the output ports. 
We start by considering coding for the output ports, assuming 
that the coding vectors at the input ports are given. 

1) Coding for Output Ports: Assume that at time t, the 
topological order has reached supernode V. For the output 
ports, we assume that in the topological order, yj precedes 
yX ^ J — k- Consider a certain subset W G Ct. Some of 
the ports in this subset can be input ports and some of them 
output ports, as is the case in Figure [18] This can occur if 
the topological order has already reached the output ports of 
several supernodes in the layer, while other output ports at the 
same layer have not yet been reached. Suppose that the ports 
in a subset of the list are given by W = {u>i, • ■ ■ , wr] and 
their coding vectors are given by 



W = {wi, ■ • • ,W H }. 



(20) 



If the set W contains p > 1 input ports from I(V), then 
the subset W has to be updated. After the coding of the 
output ports of supernode V, the input ports in I(V) will 
be replaced by p of the output ports from 0(V). Without loss 
of generality, assume that the p ports in W that are also in 
I(V) are {wx, ■ ■ ■ ,w p } = {x\ , ■ ■ ■ ,xY}. We choose a set of 
size p from 0(V) and denote the set by {w[, • • • , w' p }. There 
are (™) such possible sets. 

We now update list C t to Ct+x- In Ct+i, we replace W = 
{wi, ■ ■ ■ ,w R } with W = {w[, ■ ■•w' p ,w p+ i, ■ ■ ■ ,wr} for all 
(™) choices of {w[,--- ,w' p }. In order for the invariant to 

'The coding vectors at the edges incoming into each sink can be made 
known to the sink by the source transmitting the identity matrix. Thus, the 
sink is informed of the matrix which it must invert for decoding. This idea is 
similar to the one used for network coding 1311 . 



be maintained, we require the coding vectors of all (™) new 
subsets to be simultaneously linearly independent. 

Lemma 10.3: Consider a subset W G Ct, which contains 
p > 1 input ports from I(V). If the field size q > 2, then 
there exists a set of coding coefficients f3f x y y y\ G ¥ q for 
I < i < n, 1 < j < n such that the coding vectors of a subset 
W' E Ct+i are linearly independent. 

Proof: The coding vectors of W' are 



w '= E%,^+ v i' 



i=l 



W p+ i, ■ • • ,W R 



where Vj, i = 1, • • ■ ,p, are the contributions of the coding 
vectors of the input ports in I(V) \ W. The v,'s are assumed 
to be fixed. Since W G Ct, invoking the inductive hypothesis, 
the set of vectors W = {x^, ■ ■ ■ , , w p +i, • • ■ , w^} is a 
basis. We need to determine under which conditions the subset 
W is also a basis. 
Consider the equation 




In order for W to be a basis, we find a sufficient condition 
for (f \ = (f2 = • ■ ■ = <Pr = to be the only solution to ( TSTI ). 
First, we express v, in the basis W as 



7M x i 



v 



7p+i,» w p+i 



where 7 are not all zeros. Substituting and rearranging the 
terms of (Bil l yields 

fiWxY.yY + 7ia) + • • ■ + <p p (f3 x v iy v + 7i, p ) 

+ <Pl(PxV,yY + 7p,l) + • • • + Vp{PxV, y v + 7p,p) 

+ [<PiJ P +i,i H h <P P 7p+i,p + ¥p+i] w p+i H 

+ [<PiJr,i H h <^p7fl,p + <Pr] Wfl = 0. (22) 

Since W is a basis, it follows that 

<Pi(PxY,vY +7i,i) H 1" <Pp(PxY, y v +7i,p) = 

<Pl(PxV,yY + 7p,i) + • ■ • + <P P {PxY, y v + 1p,p) = (23) 
This can be written in matrix form as 
(PxY,yY + ■■■ PxY,yY +7i,p\ /cpA /0 N 



\PxY,yY +7 P , 



•)Y + 7p,p/ xPp 




(24) 



,0y 



We note that <pi = • ■ ■ = cp p = is the only solution of 
( 1211 ) if and only if the matrix in Equation d24T i is non-singular. 
For a p xp matrix over a field F g , the total number of matrices 
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is q p . Using a combinatorial argument, the number of non- 
singular matrices is 



(q p - l)(q p - q)(q p - q 2 ) ■ ■ ■ (q p 
1 \ ( . 1 

V p 



1 



1 



7 p-l 



- 1 ) (25) 

--W(i-V. 

qj q 

Equation d25l l can be explained as follows. For the first column 
of the matrix, we can choose any vector, except the zero vector. 
There are q p — 1 such vectors. For the second column, we 
can choose any vector except any multiple of the first column 
(which includes the zero vector). Thus, there are q p — q choices. 
In general, there are q p — q 1 ^ 1 choices for the ith column. 

So far, we have shown the conditions for ipi = ■ ■ ■ = 
ip p = to be the only solution to (124) . If these conditions 
are maintained, then (f2Tb becomes 



Theorem 10.1: The invariant of the algorithm is maintained 
for the output ports. 

Proof: By assumption, the invariant is maintained for the 
set Qc t{iy which contains the input ports of the supernodes in 
layer I. We need to show that the invariant is maintained for the 
set Qt(o)> which contains the output ports of the supernodes 
in layer I, where t(I) < t(0). This follows by induction from 
Lemma 110.41 ■ 

The average complexity of this stage is computed using 
arguments similar to those in [TT31 for the network code 
construction. According to Lemma 110.41 we can choose the 
alphabet size at this stage to be q = 2n( nNl R \ y ° r ). It follows 
from the proof of Lemma 110.41 that the probability of failure 
when the coding coefficients are chosen randomly is upper 
bounded by 



<Arw_r = 0. 



(26) 



(nN lay 

Pf <L^L 



)nq' 



(29) 



The only solution to this relation is tp p+1 = ■ ■ ■ = ip R = 
since the vectors w p +i, ■ ■ • , w# are in the basis W and are 
therefore linearly independent. 

We conclude that if the matrix in (l24l i is non-singular, then 
the vectors in W are linearly independent. If q > 2, then 
the number of non-singular matrices is positive, and we can 

choose the set of coding coefficients /3,v „v, for 1 < i < In our case, the equivalent to the number of sinks |T| in 
n, 1 < j < n, such that the matrix is non-singular. 



Therefore, the expected number of trials until the vectors in 
W form a basis is at most 2. A single layer has at most Ni ayer 
supernodes. The total number of edges connecting input and 
output ports of a certain supernode is n 2 . It follows that the 
total number of edges at the layer is bounded by Ni ayer n 2 . 



Lemma 10.4: If alphabet size q > n 



'nNic 



" j, then there 

'„ for 1 < i < 



exists a set of coding coefficients /3 x y y y £ 
n, 1 < j < n, such that all the subsets in C t +i have linearly 
independent coding vectors simultaneously. 

Proof: The subset W £ Ct contains p > 1 input ports 
from /(V). From ( |25T l, it follows that for a specific subset W 
in £t+i, the number of non-singular matrices is at least 



I--] >_q p 



1-2 



(27) 



where the last inequality follows from Bernoulli inequality 
which holds when p > and q > 1. Thus, the number of 
singular matrices is at most 



lfl5ll is the size of Ct, which is at most ( nAr ^«" J. Therefore, 
similarly to the complexity in lfT5l for network coding, the 
average complexity for a layer is 0(( nNi £ vl "')Ni a y er n 2 R). 
If the total number of layers is A, then the total average 
complexity of finding the coding coefficients of the output 
ports is 0(( nN '^")N layer n 2 RX). 

2) Coding for Input Ports: The coding for the input ports 
is performed jointly over all supernodes in the same layer. 
Assume that the coding coefficients of the output ports of layer 
I have already all been updated according to Section IX-C1I 
We need to update the coding coefficients of the input ports 
of layer I + 1. The list £ t contains ports from layer I only. 
From the list £ t , we choose an arbitrary subset 



W = {wi, ■ ■ ■ ,w R } £ C t 



(30) 



r 



i - 



p 



= pq p - 1 < 



nq 



(28) 



In Ct+i, there are at most (J lNl ^y r ^ newly added subsets. 
For each subset, there are at most nq n _1 choices of a set 
of coding coefficients f3 x v v, for 1 < i < n,l < j < n, 
such that the coding vector associated with the subset is 
linearly dependent. Therefore by the union bound there are at 
most ( nNl R ^ ysr )nq n _1 choices of sets of coding coefficients 
such that at least one of the subsets in Ct+i can have 
dependent coding vectors. The total number of choices of 
coding coefficients is q n . Therefore, if q > n( nNl £ v " r ^, then 
we will have at least a single set of coding coefficients such 
that all the subsets in C t +i have linearly independent coding 
vectors simultaneously. ■ 
We note that for each supernode, the coding vectors of 
the output ports can be viewed as columns of a parity check 
matrix of a Maximum Distance Separable (MDS) code with 
parameters (n, k = p). 



The set W' is a subset of R input ports at layer I + 1. 
Consider the bipartite network that consists of W and the input 
ports of layer I + 1, with edges from ports in W to ports in 
layer I + 1. Let B(W, W) denote the bipartite subgraph with 
vertices (W, W) and edges from ports in W to ports in W'. 
Let G(yy y/i) denote the incidence matrix for B(W, W). If 
G(w,w) is full rank, then we shall show that we can find 
coding coefficients such that the coding vectors of the ports in 
W are linearly independent. If we cannot find a set W' such 
that G^y/y/i) is full rank, then we remove W from the list 
Ct and do not replace it with a new set W' . Nevertheless, we 
shall show that whenever W is regular, we can always find a 
regular W' such that G(w,w r ) ^ s rank. 

In Figure [19] we see the sets W, W . It can be verified 
that R = 3; however, the rank of the incidence matrix, 
rank(G(w,w')) = 2. The set W' is not regular according to 
Definition 110.11 since the upper and the lower ports in W' 
always receive the same symbol. 
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Fig. 19: Example of a non-regular W'. Ports in black are in 
W, and ports in grey are in W. 



Lemma 10.5: If W' is a regular set containing R input ports 
of supernodes at layer I + 1 for some 1 < I < X — 1, then there 
exists a regular set W containing R output ports on supernodes 
at layer I such that the incidence matrix G(w w>\ is full rank. 

Proof: The result follows from |[T3l lfl4l ifTTH Specifically, 
in fi"7l . a path is defined as a disjoint set of edges (ei , ■ ■ ■ , e M ) 
where e\ starts from the source, e M enters a certain sink, and 
Ci enters the same supernode from which e^+i emerges. In 
this proof, we consider the virtual sink T(W') as our sink. 

In ATI , linearly independent (LI) paths are defined. Con- 
sider the subgraph G' of the network G consisting of K paths 
from source 5* to T{W'). The paths are LI if in G' the rank 
of the incidence matrix of any cut is exactly K. We call a set 
of R LI paths the underlying flow F u . It has been shown in 
ifTTl that such an underlying flow F u exists. In F u , we can 
consider the R output ports of layer /, one from each path. 
This set of R ports is guaranteed to be regular, by definition 
of the underlying flow. This set of output ports will be chosen 
as W . The rank(G(^n")) is full rank, again by definition of 
linearly independent paths. Therefore, the two properties of 
the lemma are maintained. ■ 

We note that in our construction we do not need to find 
the edges in F u . The concept of the underlying flow was 
introduced only for the proof of Lemma 110.51 

Lemma 10.6: For a regular set W' , we can find coding 
coefficients such that the coding vectors W are linearly in- 
dependent simultaneously if the alphabet size q > 2{ aNl ^ y "'^. 

Proof: By Lemma 110.51 given a regular set W', there 
exists a regular set W containing R output ports of supernodes 
in layer I such that the incidence matrix G(w,w) is full rank. 
The coding vectors of the output ports in W are given by 

W = {w 1; -.- ,w fl }. (31) 
The coding vectors in the input ports in W' are given by 

W' = {w[,--- ,w' R }. (32) 
The vector is in the form 



R 



(33) 



where kj are the virtual coefficients, and w, is the contribution 
of output ports of layer I that are not in W. The binary 
coefficient g t j is the (i, j)th element of matrix G(w,W')- 

2 These works consider the problem of unicast communication. We can use 
their result for the proof of Lemma 110.51 since we are interested here in a 
single set W' with a corresponding virtual sink T(W'). 



We need to find the conditions on the coefficient kj under 
which the ports in W' have coding vectors which are linearly 
independent. Consider the equation 

tpxvfi H h iJfiwi; = 0. (34) 

Combining with d33l l. and rearranging, 



J =1 



3=1 



= -axvfi a/i m i„w hmi „. (35) 

We can represent vector w, in the basis W as 

w ; = 7 M wi H h jR t iW R . (36) 

Combining with d35l ) and rearranging terms, 

(<PiJi,i-\ h <Pr1i,r + Vigi.ih H h (pRgR t iki)wi 

H h (^i7fl,i H 1" VR1R..R 

+ ipihi iR k R H h <PRgRfik R )vf R = 0. (37) 

Since W is a basis, it follows that 

<Pi7i,i H 1- Vr1i,r + 7i3ia fc i H 1- Vflf-R,i fc i = 

Vi7-R,i H 1" fRlRfi 

+(pig\,Rk R H h <j>Rg R , R k R = 0, 

or in matrix notation, 

/ 7i,i+5i,i^i "■ 7i, -R + gn.iki \ (ipi 



/ON 



\7h.i + di.Rk-R ■■■ 7fl,fl + 9R,RkR/ XVR/ \0 



(38) 



Let H denote the matrix on the left-hand side of (138b - The 
zero vector is the only solution to (l3~8l l if and only if the 
matrix H is full rank. The determinant of the matrix H is 
a polynomial in the parameters {"fi,j,kj,gij, for 1 < i,j < 
R}. Denote the polynomial by Aw,w'(@i,j, kj, hij). When 
all jij = 0, row i of H is equal to the ith row of G(tv,w) 
multiplied by ki. Therefore, the polynomial Aw,w is of the 
form: 

R 

A w , wi (j h j,kj,gij) = det(G(w tW t)) kj + Sfrij, 

3=1 

(39) 

where det(G(vv,w)) 7^ ^ is the determinant of (non-singular) 
matrix G\y,W> an d 6(-) is a polynomial such that the sum of 
the degrees of all the parameters kj, 1 < j < R, is smaller 
than R. It follows that for constant lij,gij, 1 < i,j < R, 
Aw,w is not the zero polynomial. 

The polynomial Avk,h" corresponds to the pair of regular 
subsets (W, W'). We need to find the corresponding polyno- 
mials for all regular W'. Let Vt denote the set of all these pairs 
(W, W). In order for all such sets W to have independent 
coding vectors, we need to assign the coding coefficients such 
that the following polynomial does not vanish to zero. 



p= ii A w<w ,. 

W':3(W,W')eV t 



(40) 
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By definition, P is not the zero polynomial since it is a product 
of nonzero polynomials. Hence, there is a set of coefficients 
kj such that the polynomial does not vanish to zero. 

Next, we discuss how such coefficients kj's can be found. 
Reference J4J proposes an algorithm to find a vector a such 
that a given polynomial P evaluated at a is not equal to 
zero, i.e. P{a) ^ 0. We reproduce this algorithm from |@] 
in Algorithm Q] for completeness. 

Input : A polynomial P in variables £1, * • ' j £«; 

integers i = 1, t = 1 
Output: a = (ai,a 2 , a n ) 

Step 1 : Find the maximal degree d of P in any variable 
£i, and let % be the smallest number such that 

2 l > d; 

Step 2 : Find an element at in F 2 i such that 

P(0k=a t ^Oand letP^P(0k s =a t ; 

Step 3 : If i = n, then halt; otherwise, t <— £ + 1 and go 
to Step 2; 

Algorithm 1: Algorithm to find a such that P(a) ^ 0. 

In our scenario, the maximal degree of each variable in 
^w,w is 1 because of the structure of the matrix. It follows 
that the maximal degree of each variable in P is at most 
^nNiayery Therefore, d = (J lNl ^«^ and we can always choose 
i = \\og( nNl ^ v ' r )~\. It follows that an alphabet larger than 
2 ri°g(" JV ^ ! " ! '')l < 2 (™ A W) will ensure that there exists 
coding coefficients such that W are linearly independent. ■ 

Theorem 10.2: The invariant of the algorithm is maintained 
for input ports. 

Proof: We prove the theorem by induction. For the base 
case, consider the R upper output ports of the source S. We 
assign the standard basis as coding vectors for these R output 
ports. We then apply Lemmas 110.51 and 110. 6l to the first layer. 

For the inductive step, assume that the statement holds for 
Qt, where Q t contains the output ports of the supernodes in 
layer I. Now, we show that the invariant is maintained for 
Qt+i, which contains the input ports of the supernodes in layer 
I + 1. If W is a regular subset, then by Lemma [10.61 there 
is a coding assignment (according to the code construction 
presented) such that vectors in W are independent. Therefore, 
the invariant is maintained also for layer I + 1. ■ 

We now analyze the complexity. For each pair of sets 
(W, W), we need to verify whether the incidence matrix 
G(w,w) °f tne bipartite graph B(W, W) is full rank. This 
can be performed by determinant computation in complexity 
0(R 3 ). Therefore, the total complexity of this stage for a 
single layer is 0(R 3 ( nA W-)). 

For each variable of P, we require an iteration of the 
Algorithm [TJ Each iteration of Algorithm [1] takes at most 
assignments need to be verified. There are at 
most Ni ayer supernodes at each layer. Therefore, the maximal 
number of output ports, which is also the number of variables 
kj, is nNi ayer . It follows that the complexity of the coding for 
input ports for a single layer is 0(( nN '^ y ' 7 ') (nNi ayer + R 3 ). 
Therefore, if we consider all A layers, the complexity for the 



coding for the input ports is 0{{ nNl ^ v " r )\(nNi ayer + R 3 )). 
Combining the complexity for both input and output ports, 
it follows that the total complexity of the algorithm is 

0(( nN ^'")N layer n 2 XR). 

XI. Conclusions 

ADT networks 1 1 1[2| have drawn considerable attention for 
their potential to approximate the capacity of wireless relay 
networks. In this paper, we showed that the ADT network 
can be described well within the algebraic network coding 
framework J4). This connection between ADT network and 
algebraic network coding allows the use of results on network 
coding to understand better the ADT networks. 

In this paper, we derived an algebraic definition of min- 
cut for the ADT networks, and provided an algebraic in- 
terpretation of the Min-cut Max-flow theorem for a single 
unicast/mulciast connection in ADT networks. Furthermore, 
by taking advantage of the algebraic structure, we have shown 
feasibility conditions for a variety of set of connections C, 
such as multiple multicast, disjoint multicast, and two-level 
multicast. We also showed optimality of linear operations for 
the connections listed above in the ADT networks, and showed 
that random linear network coding achieves the capacity. 
Furthermore, we extended the capacity characterization to 
networks with cycles and random erasures/failures. We proved 
the optimality of linear operations (as well as random linear 
network coding) for multicast connections in ADT networks 
with cycles. By incorporating random erasures into the ADT 
network model, we showed that random linear network coding 
is robust against failures and erasures. 

Taking advantage of this insight, we proposed an efficient 
linear code construction for multicasting in ADT networks 
while guaranteeing decodability. The average complexity of 
the construction is 0(( nNl £ y ' r )Ni ayer n 2 \R). The required 
field size is at most q = n(" l ^ v ° r )', thus, the block length 
required to represent a symbol is at most k = log(n( Ar!a * c '' n )) . 
Our code construction does not require finding network flows 
or knowing the exact location of the sinks. When normalized 
by the number of sinks, our code construction has a complexity 
which is comparable to those of previous coding schemes for 
a single sink. A possible direction for future research is to 
use our construction to find new coding schemes for practical 
multiuser networks with receiver noise. 
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